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Abstract 

One of the most popular copulas for modeling dependence structures is t-copula. 
Recently the grouped t-copula was generalized to allow each group to have one mem- 
ber only, so that a priori grouping is not required and the dependence modeling is 
more flexible. This paper describes a Markov chain Monte Carlo (MCMC) method 
under the Bayesian inference framework for estimating and choosing t-copula mod- 
els. Using historical data of foreign exchange (FX) rates study, we found 
that Bayesian model choice criteria overwhelmingly favor the generalized t-copula. 
In addition, all the criteria also agree on the second most likely model and these 
inferences are all consistent with classical likelihood ratio tests. Finally, we demon- 
strate the impact of model choice on the conditional Value-at-Risk for portfolios of 
six major FX rates. 

Key words: grouped t— copula, dependence modeling, Bayesian model choice, 
Markov chain Monte Carlo, foreign exchange. 



1 Introduction 



Copula functions have become popular and flexible tools in modeling multivariate de- 
pendence among flnancial risk factors. In practice, one of the most popular copulas in 
modeling multivariate financial data is perhaps the i-copula implied by the multivariate 
^-distribution (hereafter referred to as standard t-copula); see Embrechts et al (2001), 
Fang et al (2002), and Demarta and McNeil (2005). This is due to its simplicity in terms 
of simulation and calibration, combined with its ability to model tail dependence which 
is often observed in financial returns data. Papers by Mashal et al (2003) and Breymann 
et al (2003) have demonstrated that the empirical fit of the standard t-copula is superior 
in most cases when compared to the Gaussian copula. However, the standard i-copula 
is often criticized due to the restriction of having only one parameter for the degrees of 
freedom (dof), which may limit its ability to model tail dependence in multivariate case. 
To overcome this problem, Daul et al (2003) proposed the use of the grouped t-copula, 
where risks are grouped into classes and each class has its own standard t-copula with a 
specific dof. This, however, requires an a priori choice of classes. It is not always obvious 
how the risk factors should be divided into sub-groups. An adequate choice of grouping 
configurations requires substantial additional efi'ort if there is no natural grouping, for 
example, by sector or class of asset. 

Recently, the grouped t-copula was generalized to a new t-copula with multiple dof 
parameters (hereafter referred to as generalized t-copula); see Luo and Shevchenko (2010) 
and Venter et al (2007). This copula can be viewed as a grouped t-copula with each 
group having only one member. It has the advantages of a grouped t-copula with flexible 
modeling of multivariate dependences, yet at the same time it overcomes the difficulties 
with a priori choice of groups. For convenience, denote the new copula as tj^-copula, where 
u — (i^i, i^n) denotes the vector of dof parameters and n is the number of dimensions. 
Luo and Shevchenko (2010) demonstrated that some characteristics of this new copula in 
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the bivariate case are quite different from those of the standard i-copula. For example, 
the copula is not exchangeable if i^i 7^ z/2 and tail dependence implied by the tj^-copula 
depends on both dof parameters. The difference between ti,- and standard i-copulas, in 
terms of impact on Value-at-Risk (VaR) and conditional Value-at-Risk (CVaR) of the 
portfolio, can be significant as demonstrated by simulation experiments for the bivariate 
case. This difference is even much larger than the difference between Gaussian copula 
and the standard i-copula. In examples of maximum hkelihood fitting to USD/AUD 
and USD/JPY daily return data, standard t-copula was statistically rejected by a formal 
Likelihood Ratio test in favour of the i,^ copula (i.e. dof parameters in the fj^-copula were 
statistically different). 

This paper presents a Bayesian model selection study on the t-copula models in the 
multivariate case. We demonstrate how to perform Bayesian inference using Markov chain 
Monte Carlo (MCMC) simulations to estimate parameters and make decisions on model 
choice. Prom a Bayesian point of view, model parameters are random variables whose dis- 
tribution can be inferred by combining the prior density with the likelihood of observed 
data. The complete posterior distribution of the parameters resulting from Bayesian 
MCMC allows further analysis such as model selection and parameter uncertainty quan- 
tification. Specifically, we solve a variable selection problem in the same vein as discussed 
in Cairns (2000). Increasingly, Bayesian MCMC finds new applications in quantitative 
financial risk modeling. Recent examples are found in Peters et al. (2009, 2010) for insur- 
ance, Shevchenko (2010) for operational risk and Luo and Shevchenko (2010) for credit 
risk. 

As a case study, we consider the application of modeling dependence among six major 
foreign exchange (FX) rates (AUD, CAD, CHF, EUR, GBP and JPY, against USD) 
using i-copulas. Following common practice (see e.g. McNeil et al 2005), we use the 
G ARCH (1,1) model to standardize the log- returns of the exchange rates marginally. Then 
the GARCH filtered residuals of the six major FX rates are modeled by a t-copula. In 
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this study we consider altogether 33 competing t-copula models: the standard i-copula, 31 
grouped t-copulas and the generalized t-copula (i.e. tj^-copula). The 31 grouped t-copulas 
are a complete set of all possible combinations of two groups from six FXs (see Table 1 
for all possible 2-group configurations for the six FX majors). 

We present procedures and results of MCMC simulation for t-copula models under 
the Bayesian framework. Also, we demonstrate using Bayesian model inference and ac- 
tual data, that the generalized i-copula (fj^-copula) is convincingly the model of choice 
for modeling dependence between six FX majors, among considered 33 t-copula models. 
Even compared with the best grouped i-copula chosen from 31 possible two-group config- 
urations, the f^-copula is overwhelmingly favoured by the Bayesian factors obtained from 
the MCMC posterior distribution. We demonstrate that the joint calibration of grouped 
i-copula can be done very efficiently by applying MCMC. Using model parameters esti- 
mated from MCMC, we also demonstrate the impact of model choice on CVaR of two 
portfolios of six FX majors. 

The organisation of this paper is as follows. Section 2 introduces the various i-copula 
models and notations. Then it describes the GARCH model filtering for the six FX 
majors, and calibration of the i-copula models using the maximum likelihood method. 
Section 3 discusses the Bayesian inference formulation, the MCMC simulation algorithm, 
the reciprocal importance sampling estimator and the deviance information criterion for 
model selection. Direct computing of the posterior model probability is also discussed 
in Section 3. Section 4 presents MCMC results and the corresponding Bayesian model 
selection, in comparison with the traditional maximum likelihood results and Likelihood 
Ratio tests. Examples of portfolio CVaR calculation using selected models and cahbrated 
parameters are provided in Section 5, demonstrating the impact of model choice on risk 
quantification. Concluding remarks are given in the final section. 
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2 Model, data and maximum likelihood calibration 

It is well known from Sklar's theorem (see Sklar 1959 and Joe 1997) that any joint 
distribution function F with continuous (strictly increasing) margins Fi, F2, . . . , F„ has a 
unique copula 

C(u) = F{Fr\u,), F,-\u2), . . . , F-\un)). (1) 

The i-copulas are most easily described and understood by a stochastic representation, 
as defined below. 

2.1 t-copula models 

We introduce notation and definitions as follows: 

• Z = {Zi, . . . , Zn)' is a random vector from the multivariate normal distribution 
$s(z) with zero mean vector, unit variances and correlation matrix S. 

• U = {Ui, . . . , Un)' is defined on [0, l]"" domain. 

• V is a. random variable from the uniform (0,1) distribution independent of Z. 

• W = G~^{V), where Gu{-) is the distribution function of \fvJS with S distributed 
from the chi-square distribution with v dof, i.e. W and Z are independent. 

• tv{,^ is the standard univariate t-distribution and t~^(-) is its inverse. 

Then we have the following representations. 

Standard i-copula 

The random vector 

X = X Z (2) 

is distributed from a multivariate t-distribution and random vector 

U=(t,(Xi),...,t.(X„))' (3) 
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is distributed from the standard i-copula. 
Grouped t-copula 

Partition {1,2, ... ,n} into m non-overlapping sub-groups of sizes rii, . . . , rim- Then the 
copula of the random vector 

X = {W,Z,, W,Z^„W2Z^,+,, W^Z^.+n,, W^Z^)', (4) 

where Wk = G~^{V), k = 1, . . . ,m, is the grouped t-copula. That is, 

U = . . . , . . . , t^2(-^ni+n2)) • • • ) ^umi^n))' (5) 

is a random vector from the grouped t-copula. Here, the copula for each group is a 
standard i-copula with its own dof parameter (i.e. Uk is dof parameter of the standard 
i-copula for the k-th group). 

Generalized t-copula with multiple dof (fi,-copula) 

Consider the grouped i-copula where each group has a single member. In this case the 
copula of the random vector 

X={WiZ,,W2Z2,...,WnZny (6) 

is said to have a t-copula with multiple dof parameters u = {ui, . . . , 1/^), which we denote 
as fi^-copula. That is, 

\] = {U,{X,),t,,{X,),...,t,SXn))' (7) 

is a random vector distributed according to fj^-copula. Note, all Wi are perfectly depen- 
dent. 

Remark: Given the above stochastic representation, simulation of the i^, copula is 
straightforward. In the case of standard t-copula ui = ■ ■ ■ = Un = v and in the case 
of grouped i-copula the corresponding subsets have the same dof parameter. Note that, 
the standard t-copula and grouped i-copula are special cases of fi,-copula. 
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From the stochastic representation (6-7), it is easy to show that the tj^-copula distri- 
bution has the following explicit integral expression 
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(u) = / ^•E{zi{ui,s),...,Zn{Un,s))ds (8) 

and its density is 







dui . . .du. 





Here: 



- = / ^s(2;i(mi,s),...,2;„(m„,s)) ^rfs/JJ^(xi). (9) 

" i i=i i=i 



• Zi{ui, s) = t^l{ui)/wi{s), i = l,2,...,n; 

. w,is) = G-\sy, 

• V^s(^i5 ■ ■ ■ 5 Zn) = exp(— iz'S^^z)/[(27r)"'/^(detS)^/^] is the multivariate normal den- 
sity; 

• Xi = t-i(Mi), i = l,2,...,n; 

• /^(x) = (1 +xVz/)"^''+^^/^r((z/ + l)/2)/[r(z//2)v^ is the univariate t-density, 
where r(-) is a gamma function. 

The multivariate density 1^ involves a one-dimensional integration which makes the 
density calculation computationally more demanding than in the case of the standard 
t-copula, but still practical using available fast and accurate algorithms for the one- 
dimensional integration. If all the dof parameters are equal, i.e. z/i = ■ ■ ■ = i/„ = z/, 
then it is easy to show that the copula defined by ([8]) becomes the standard t-copula; see 
Luo and Shevchenko (2010) for a proof. 

2.2 FX data and GARCH filtering 

As a case study we consider modeling dependence between six FXs using t-copulas intro- 
duced in previous section. The daily foreign exchange rate data for the six FX majors in 



the period January 2004 to April 2008 (a total of 1092 trading days) were downloaded from 
the Federal Reserve Statistical Release (http://www.federalreserve.gov/releases). These 
daily data have been certified by the Federal Reserve Bank of New York as the noon buy- 
ing rates in New York City. For our purpose, we study the six major currencies (AUD, 
CAD, CHF, EUR, GBP and JPY). Rates were converted to USD per currency unit in the 
present study, if not already in this convention. This unified convention allows a portfolio 
of currencies to be conveniently valued in terms of a single currency, the USD. 

Following common practice (see McNeil et al 2005), we use the GARCH(1,1) model 
to standardize the log-returns of the exchange rates marginally. The G ARCH (1,1) model 
calculates the current squared volatility as 

at + a(xt-i - lif + /5(7t-i, > 0, a, ^ > 0, q; ^ < 1, (10) 

where Xt-i denotes the log- return of an exchange rate on date t — 1. GARCH parameters 
a;, a and /3 are estimated using the maximum likelihood method. Log-return was modeled 

as 

= // + (7t£(*), (11) 

where ji is the average historical return or drift for the asset and is a sequence of 
iid random variables referred to as the residuals. The GARCH filtered residuals of the 
FX rates were then used to fit the i-copula models. Before the fitting the residuals were 
transformed to the (0,1) domain marginally using empirical distributions of the residuals. 

2.3 Configuration of grouped t-copula 

With six dimensions, the grouped t-copula can have a total of 201 possible combinations 
(not counting the standard t-copula and the tjy-copula). In this study we concentrate on 
the class of configurations with two groups only, which is the next level of complexity 
compared with the standard t-copula. This reduces the number of possible grouped t- 
copula models to 31. These 31 grouped t-copula models are: 
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• 10 models from the complete subset of (3,3) configurations (with two groups and 
three members in each group). 

• 15 models from the complete subset of (2,4) configurations (with two members in 
the first group and four members in the second group). 

• 6 models from the complete subset of (1,5) configurations (with one member in the 
first group and five members in the second group). 

Note, a (1,5) combination is the same as a (5,1) combination, and a (2,4) combination 
is the same as a (4,2) combination. So, altogether we have 33 competing models to choose 
from - the standard i-copula, the 31 two-grouped i-copula and the generalized i-copula 
(fy-copula) . 

Table 1 lists all 33 models for modeling the six FX majors, their grouping configurations 
and parameter notations. In column 2 of Table 1, each pair of parentheses define a sub- 
group configuration. The generalized grouped i-copula has six sub-groups with a single 
member in each sub-group, while the standard t-copula has one group containing all six 
members. Note that for the grouped i-copula, exchanging the two sub-groups makes no 
difference - these two configurations have exactly the same combinations of members, so 
no new models will emerge from this exchange. 

2.4 Maximum likelihood calibration 

Consider a random vector of data Y = {Yi, . . . ,1^)'. To estimate a parametric copula 
using observations y*^-'^ j = 1,. . . ,K, where K is the number of observations, the first 
step is to project the data to the [0, 1]" domain to obtain u*--'^ using estimated marginal 
distributions. In our study the margins are modeled using empirical distributions but it 
can also be modeled using parametric distributions or a combination of these methods, 
e.g. empirical distribution for the body and a generalized Pareto distribution for the 
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tail of a marginal distribution (McNeil et al 2005, page 233). Given pseudo sample u^-'^ 
constructed using the original data, the copula parameters can be estimated using, for 
example, the maximum likehood method or MCMC 

Accurate maximum likelihood estimates (MLEs) of the copula parameters should be 
obtained by fitting all unknown parameters jointly. In practice, to simplify the calibration 
procedure, correlation matrix coefficients for t-copulas are often calculated pair-wise using 
Kendall's tau rank correlation coefficients T{Yi, Yj) via the formula (McNeil et al 2005) 

Eij = sin (i7rr(Fi, Yj)) . (12) 

Then in a second stage the dof parameters z/i, . . . , z/„ are estimated. Strictly speaking (fT2ll 
is valid for bivariate case only, however in practice it works well for multivariate case too. 
It was noted in Daul et al (2003) that formula (12) is still highly accurate even when it is 
applied to find the correlation coefficients between risks from the different groups. McNeil 
et al (2005) observed that the estimated parameters using Kendall's tau are identical to 
those obtained by joint estimation to two significant digits, confirming good accuracy 
of the Kendall's tau simplification. It was also observed in Luo and Shevchenko (2010) 
that the difference in estimated parameters between the Kendall's tau approximation and 
the joint estimation was mostly in the third significant digit and was smaller than the 
standard errors for the MLEs. In addition, a study of small sample properties in Luo and 
Shevchenko (2010) showed that the bias introduced by the Kendall's tau approximation 
is very small even for a small sample size of 50. In the present work the data sample size 
is over 1000. The small bias of the Kendall's tau approximation is certainly insignificant 
when compared with the often large difference existing between dof parameters of different 
t-copula models. In other words, using ( fT2l) for the correlation coefficients should cause 
little material difference in the present model choice study where the difference is expected 
to come from different group configurations. 

Because the Kendall's tau approximation is applied pair-wise, we have identical correla- 
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tion matrix for all the copula models to be considered. This simplification is computation- 
ally very significant for the grouped t-copula for which the calibration using density (|9j) 
is computationally demanding. By using the Kendall's tau approximation, the number of 
unknown parameters reduces from M = n{n+ l)/2 to M = n for the generalized grouped 
t-copula. With six-dimensions considered in this study, this amounts to a reduction from 
21 parameters to only 6. For the grouped t-copula with two groups, this reduction is 
from 17 to 2, an even more dramatic reduction. A substantial saving of computing time 
is achieved in both cases. 

Remark: An accurate calibration of grouped t-copula requires joint estimation of dof 
parameters. Sometimes in practice an approximate approach is taken where a grouped 
t-copula is calibrated marginally, i.e. each sub-group is calibrated separately using a 
standard t-copula. This approximation is not always justified; also it can not be applied 
to a generalized t-copula. For a proper and fair comparison between the grouped t-copula 
and the generalized grouped t-copula, in this study we perform joint calibration for both 
copulas. When the grouped t-copula is calibrated jointly, its density is given by the 
integral formula (Q, the same as the generalized t-copula, so a proper joint calibration 
of the grouped t-copula is also computationally demanding when compared with the 
calibration of a standard t-copula. 

Let u be the vector of n dof parameters z/i, . . . , i/„ (the grouped t-copula is treated as a 
special case of tjy-copula). Denote the density of the tjy-copula evaluated at u*^-'^ as Ci^(u*^-')), 
which can be obtained using (Q. Then the MLEs for u are calculated by maximizing the 
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log-likelihood function 

K 

i=i \i *=i 

K n 

j=l i=l 

n 

+K J2 (l H^^^) + ln[r(|j/,)/r(i(z/, + 1))]) , (13) 

i=l 

where xp'' = t~^^{u[^'^), zl''\s) = x^p /wi{s), i = 1, . . . ,n, j = 1, . . . , K. In this work we 
use the double precision IMSL function DQDAGS, a globally adaptive integration scheme 
documented in Piessens et al (1983) for the integration in ([9]). For the maximization of 
(fT3l) the double precision IMSL function DBCPOL is used, which employs a direct search 
Simplex algorithm that does not require calculation of gradients. 



3 Bayesian inference and MCMC 

In this section we describe Bayesian approach and MCMC procedure to estimate t-copulas, 
and model selection criteria used to choose the t-copula model. Under the Bayesian 
approach, the model parameters 6 (in our case is just the dof parameter u) are treated 
as random variables. Given a prior distribution 7r(0) and a conditional density of the data 
given (i.e. likelihood) 7r(y|0), the joint density of data Y and the model parameters 6 
is 7r(y, 0) = Tt{y\6)'K{6). Having observed data Y, the distribution of 6 conditional on 
Y, the posterior distribution, is determined by Bayes' theorem 

The posterior can then be used for predictive inference. There is a large number of useful 
texts on Bayesian inference; for a good introduction, see Berger (1985) and Robert (2001). 
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3.1 MCMC under Bayesian framework 

The explicit evaluation of the normalization constant in (fT4|) is often difficult especially in 
high dimensions. The complexity in our case is evident from the log-likelihood expression 
f|T3|) . The MCMC method provides a highly efficient alternative to traditional techniques 
by sampling from the posterior indirectly and performing the integration implicitly. 

MCMC is especially suited to a Bayesian inference framework. It facilitates the quan- 
tification of parameter uncertainty and model risks. It also allows a unified estimation pro- 
cedure that estimates parameters and latent variables. In the last case a special algorithm 
called data augmentation can be employed, see Tanner and Wong (1987). The Bayesian 
estimates of particular interest from MCMC are the maximum a posterior (MAP) esti- 
mate and the minimum mean square error (MMSE) estimate, defined as follows 

MAP: e'^^^ = aTgmax[7r{0\y)], (15) 

MMSE : e^^^SE ^ (^g^ 

The MAP and MMSE estimates are the posterior mode and mean respectively. If the 
prior 7r(0) is constant and the parameter range includes the MLE, then the MAP of the 
posterior is the same as MLE. 

3.2 Metropolis-Hastings algorithm 

In our case study we use the Metropolis-Hastings algorithm first described by Hastings 
(1970) as a generalization of the Metropolis algorithm (Metropolis et al 1953). Denote the 
state vector at step t as O'^'^^ and we wish to update it to a new state We generate 

a candidate 6* from density q{6\6^^'^)^ and accept this point as the new state of the chain 
with probability given by 

7r(r)g(0W|r) ) 

7r(6>W)g(6>*|6K*)) J ' ^ ' 



a 



'e^'\e*) = min 
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If the proposal is accepted, the new state = 0*, otherwise = 6^^\ The single 

component Metropolis-Hastings is often more efficient in practice. Here the state variable 
0^*) is partitioned into components 0^^^ = {0f\ ^^2*"*; • • • ? ^"^) which are updated one by one 
or block by block. This was the framework for MCMC originally proposed by Metropolis 
et al. (1953), and is adapted in this study. 

The likelihood is computed as Ti{y\6) = exp{iy{6)), where iy{0) is the log-likelihood 
given by f[T5]) . In computer implementation, we take advantage of the fact that only 
one component is updated at each sub-step in the single component Metropolis-Hastings 
algorithm by saving and re-using any values not affected by the current updating. For 
example, each evaluation of f ll3p calls for the inverse of the t-distribution for all the 
data points and all the dof values. Saving and re-using these inverse values reduce the 
calculation by a factor of six for the six- dimensional MCMC computation. 

3.3 Bayesian model selection using MCMC 

Powerful MCMC methods such as the Gibbs sampler (Gelfand and Smith 1990) and 
the Metropolis-Hastings (MH) algorithm (Hastings 1970) enable direct estimation of the 
posterior and predictive quantities of interest, but do not lend themselves readily to 
estimation of the model probabilities. While one of the most common classical techniques 
is the Bayesian Information Criterion (BIC) (Schwarz 1978), many new approaches have 
been suggested in the literature. 

The most widely used methods include the harmonic mean estimator of Newton and 
Raftery (1994), importance sampling (Fruhwirth-Schnatter 1995), the reciprocal impor- 
tance sampling estimator (Gelfand and Dey 1994), and bridge sampling (Meng and Wong 
1996, Fruhwirth-Schnatter 2004). A comprehensive review of some of these methods 
applied to Bayesian model selection can be found in Kass and Raftery (1995). 

Consider model M with parameter vector 6. The model likelihood with data y can be 
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found by integrating out the parameter 

7r(y|M)= f 7c{y\e,M)7r{e\M)de, 



where 7r{0\M) is the prior density of 6 in model M. Given a set of H competing models 
M = (Ml, M2, . . . , Mh), the Bayesian alternative to traditional hypothesis testing is to 
evaluate and compare the posterior probability ratio between the models. For model Mi 
(1 < / < H), assuming we have some prior knowledge about the model probability 7r(M;), 
we can compute the posterior probabilities for all models using the model likelihoods 

TT y = ^ . 19 

Ef=iVr(y|M,)7r(M,) 

Consider two competing models Mi and M2, parameterized by 0[i] and 0[2] respectively. 
The choice between the two models can be based on the posterior model probability ratio, 
given by 

7r(Mi|y) ^ 7r(y|Mi) 7r(Mi) ^ 7r(Mi) 

7r(M2|y) 7r(y|M2) 7r(M2) Tiih'h) ^ ' 

where Bx2 = 7r(y |Mi) / 7r(y IM2) is the Bayes factor, the ratio of posterior odds of model Mi 
to that of model M2. As shown by Lavine and Scherrish (1999), an accurate interpretation 
of the Bayes factor is that the ratio i?i2 captures the change of the odds in favour of model 
Ml as we move from prior to posterior. Jeffreys (1961) recommended a scale of evidence 
for interpreting Bayes factors, which was later modified by Wasserman (1997). A Bayes 
factor B12 > 10 is considered strong evidence in favour of Mi. For a detailed review of 
Bayes factors, see Kass and Raftery (1995). 

Typically, the integral ( ITSjl required by the Bayes factor is not analytically tractable 
and sampling based methods must be used to obtain estimates of the model likelihoods. 
In the current study we choose three methods for model selection: 

• direct estimation of the Bayes factor in f l20|) using Reciprocal Importance Sampling 
Estimation presented in Section 13.3. Ij 
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• deviance information criterion (see Section r3.3.2p : 

• direct computation of the posterior model probabilities using formula presented in 
Section I3.3.3I 

3.3.1 Reciprocal Importance Sampling Estimator 

Given samples 6^*\ t = 1, . . . , N from the posterior distribution obtained through MCMC, 
Gelfand and Dey (1994) proposed the reciprocal importance sampling estimator (RISE) 
to approximate the model likelihood as 



where h plays the role of an importance sampling density roughly matching the posterior. 
Gelfand and Dey (1994) suggested a multivariate normal or t-distribution density with 
mean and covariance fitted to the posterior sample. 

The RISE estimator can be regarded as a generalization of the harmonic mean estima- 
tor suggested by Newton and Raftery (1994). If h = 1 then f l2T|) becomes the harmonic 
mean estimator. Other estimators include the bridge sampling proposed by Meng and 
Wong (1996), and the Chib's candidate's estimator (Chib 1995). In a recent comparison 
study by Miazhynskaia and Dorffner (2006), these estimators were employed as competing 
methods for Bayesian model selection on GARCH-type models, along with the reversible 
jump MCMC. It was demonstrated that the RISE estimator (either with normal or t im- 
portance sampling density), the bridge sampling method and the Chib's algorithm gave 
statistically equal performance in model selection, and their performance more or less 
matches the much more involved reversible jump MCMC. 

3.3.2 Deviance Information Criterion 

The deviance information criterion (DIG) is a generalization of the Bayesian information 
criterion (Schwarz 1978, Spiegelhalter et al 2002). For a given model M (for simplicity 




(21) 
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we drop notation M in the formula below) the deviance is defined as 

D(0) = -21og(7r(y|0)) + C, (22) 

where the constant C is common to all nested models. Then DIG is calculated as 

Die = 2Ee[D{e)] - D{Ee[e]) = Ee[D{e)] + {Eg[D{0)] - D{Ee[e])), (23) 

where Ee[] is the expectation with respect to 6. The expectation Ee[D{0)] is a measure 
of how well the model fits the data; the smaller its value, the better the fit. The difference 
Eg[D{6)] — D{Ee[0\) can be regarded as the effective number of parameters, the larger 
this term, the easier it is for the model to fit the data. So the DIG criterion favours the 
model with a better fit but at the same time penalizes the model with more parameters. 
Under this setting the model with the smallest DIG value is the preferred model. 

3.3.3 Posterior model probabilities 

A popular approach for model choice is based on Reversible Jump MGMG (Green 1995). 
Here we adopt an alternative proposed recently by Peters et al (2009) based on the 
work of Gongdon (2006). In this procedure the posterior model probabilities 7r(Mi|y) are 
estimated using the Markov chain in each model as 



-my) = Y, ^J^' (24) 



where o'^^j^ is the MGMG posterior sample at Markov chain step t for model M/, Ly{Mi, 6^^) 
is the likelihood of y for a given model Mi with parameter vector O^j^^ , and is the total 
number of MGMG steps after burn-in period. In it is assumed that priors 7i{6[i]\Mi) 
and vr(M/) are constant. 
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4 MCMC simulation results and analysis 

Prior distributions. In all MCMC simulation runs, we assume a uniform prior for every 
model parameter. The only subjective judgement we bring to the prior is the support 
of the dof parameter. Denote the k^^ dof parameter of the /i*'^ t-copula model as u^/!^^ 
(see Table 1). We impose a common lower and upper bounds for all dof components, 
specifically 1 = z/min < J^^^^ < '^max = 100 . In our case study the support (1, 100) for 
dof parameter of the t-distribution should be sufficiently large to allow the posterior to 
be implied mainly by the observed data. To make sure the range is sufficiently large, we 
also tested a wider range of (1, 200) and found no material difference in the results. 

MCMC procedure. The starting value for the Markov chain for each component is 
set to a uniform random number drawn independently from the support (z/min, ^max)- 
In the single component Metropolis-Hastings algorithm, we adopt a truncated Gaussian 
distribution as the symmetric random walk proposal density for q{-\-) in ( [T7|) . For each 
component, the mean of the Gaussian density was set to the current state and the variance 
was pre-tuned so that the acceptance rate is close to the optimal level. For d-dimensional 
target distributions with iid components, the asymptotic optimal acceptance rate has 
been reported to be 0.234; see Gelman et al (1997) and Roberts and Rosenthal (2001). 
In pre-tuning the variances for all the components we set 0.234 as the target acceptance 
rate. In addition, the Gaussian density was truncated below z/mm and above i^max to ensure 
each proposal was drawn within the support for the parameters. Specifically, for the k^^ 
component at chain step t, the proposal density is 



where fi\i{-]fi,a) and Fi\f{-; fi,a) are the Gaussian density and distribution functions re- 
spectively, with mean /i and standard deviation a. 

An independent Markov chain was run for each of the 33 models listed in Table 1. 

18 



Each run consists of three stages: 

• Tuning - tune and adjust the proposal standard deviation to achieve optimal accep- 
tance rate for each component. 

• "Burn-in" - samples from this period are discarded. 

• Posterior sampling - here the Markov chian is considered to have converged to the 
stationary target distribution and samples are used for model estimates. 

Unless stated otherwise, we use a "burn-in" period of length A^^ = 20, 000. We then let 
the chain run for an additional N = 100, 000 iterations to generate the posterior samples. 
Each step contained a complete update of all components. 

MCMC convergence. Figure 1 shows the first 30,000 samples, taken after the burn-in 
period, of the dof component for model Mq (i.e. the case of the generalized t-copula). 
Since Mq has the highest parameter dimensions among all the candidate models, in general 
it requires the longest length of chains to converge to a stationary distribution. This figure 
shows that after the burn-in period the samples are mixing well over the support of the 
posterior distribution. 

In addition to inspecting the sample paths, we also monitor the autocorrelation of 
the samples. Figure 2 shows the autocorrelations over multiple lags computed from the 
posterior samples for component vf'' of model Mq. A useful value to compute from these 
autocorrelations for each component is the autocorrelation time defined as 



where Pk{g) is the autocorrelation at lag g for component Ok- This autocorrelation is 
sometimes used to compute an "effective sample size" by dividing the number of samples 
by Tfe. The standard errors for the parameters can then be based on the effective sample 
size to compensate for the autocorrelation (see Ripley 1987, Neal 1993). In practice it 



oo 




(26) 
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is necessary to cut off the sum in fl26p a.t g = g^^^ where the autocorrelations seem to 
have fallen to near zero, because including higher lags adds too much noise (for some 
interesting discussion on this issue, see Kass et al. 1998). As shown in Figure 2, in those 
well mixed MCMC samples the autocorrelation falls to near zero quickly and stays near 
zero at larger lags. For this study we have chosen a g"^'^^ for each component such that 
the autocorrelation at lag g^'^^ has reduced to less than 0.01. That is, the autocorrelation 
time Tk is estimated by 



The fk values estimated from MCMC output for model Mq are shown in Table 2, along 
with the cut-off lag number (7™^^. MCMC convergence characteristics for other compo- 
nents and for other models are very similar to those shown here for model Mq. 

4.1 Bayesian estimates of parameters 

This section presents results for posterior mean (MMSE), mode (MAP) and numerical 
error due to finite number of MCMC iterations. 

4.1.1 Posterior mean and its numerical error 

Table 3 shows values for the estimated mean from MCMC posterior samples for all 33 
models. The standard errors (numerical error due to finite number of MCMC iterations) 
are shown in parentheses and the log-likelihoods corresponding to the estimated means 
are in the last column. Since the samples from MCMC are typically serially correlated, the 
usual formula for estimating the standard error of a sample mean (i.e. standard deviation 
divided by V^) will introduce significant under-estimation. Here, we use hatch sampling 
for the standard error estimate of the MCMC posterior mean; see Gilks et al. (1996). 
Consider a MCMC posterior sample yi,y2, ■■■iUn with length N = Q x L, where L is 




max 




(27) 
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sufficiently large, so that the batch means 

y,^2 E y^'^ ' ? = (28) 

t=(g-l)L+l 

are considered approximately independent. Then y — {yi + ■ — \-yQ)/Q and the standard 
error of the posterior sample mean y can be approximated by 



Note that Q is the number of quasi-independent batches and L — N/Q is the size of each 
batch. 

4.1.2 Posterior mode and likelihood ratio tests 

Values of the posterior mode taken from the MCMC samples for all the 33 models are 
shown in Table 4, along with the corresponding log-hkelihood values. Using results of the 
maximum likelihood corresponding the posterior mode in Table 4, a classical likelihood 
ratio test can be performed to compare model likelihoods. 

Consider the null hypothesis that the observed FX daily return data are from distri- 
bution described by the grouped t-copula model Mi, and the alternative hypothesis that 
the data are distributed according to the generalized t-copula model Mq. The likelihood 
ratio for the two models is simply A = Li/Lq, where Li and Lq are the maximum likeli- 
hood values (i.e. the likelihood value at the mode) for Mi and Mq respectively. The test 
statistic —2 log(A) will be asymptotically distributed with degrees of freedom equal to 
the difference in the number of dof parameters in Mq and Mi, which is 4 in this case. 

We can perform likelihood ratio tests on all other grouped t-copula models (M/j, h = 
2, . . . , 31) against the same alternative hypothesis of model Mq, the generahzed t-copula. 
For the standard t-copula the difference in the number of parameters is 5. The test statistic 
and the associated p-value (x^ significance) are given in Table 4. Clearly, according to 




(29) 
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the p-value, all the null hypotheses should be rejected and the alternative hypothesis, the 
generalized t-copula model Mq, is statistically justified. 

Excluding model Mq, among the other 32 t-copula models (M^, h = 1, . . . ,32), the 
one achieving the highest likelihood is M27, which is one of the six (1,5) two-group config- 
urations. The p-value of this best grouped t-copula model against Mq is 0.0045 which is 
still very small, suggesting a rather strong rejection of the grouped t-copula (including the 
standard t-copula) in favour of the t,y-copula model Mq. Achieving the highest likelihood 
from the fifteen (2,4) configurations is model M4. It is interesting to notice that both M27 
and M4 have three European currencies (CHF, EUR, GBP) in one group (see Table 1), 
perhaps refiecting a natural geopolitical and economic grouping. 

4.2 Bayesian model choice 

While the likelihood ratio test relies on a single point estimate, the Bayesian model choice 
makes decisions based on the entire posterior distribution. As discussed in Section 3.3, 
three Bayesian inference criteria were used to choose among the 33 t-copula models: RISE 
given by (|2T1) : DIG given by fl23l) : and the posterior model probabilities flM|) . 

The RISE calculation involves fitting the MGMG posterior samples to a multivariate 
normal or t-distribution and taking expectation of the reciprocal likelihood. The DIG 
calculation requires taking expectation of the likelihood and the parameters. Golumn 2 
in Table 5 shows the RISE factor Bq^ = Ro/Rh, h = 1, ... ,32, where Rh is the RISE 
value for model Mh. That is, Boh {1 < h < 32) is a measure of strength in the argument 
that the generalized t-copula (model Mq) is the Bayesian choice. The very large Bayes 
factors {Boh > > 5.9 x 10^) shown in Table 5 overwhelmingly support the generalized 
t-copula, confirming the likelihood ratio tests discussed previously. Excluding model Mq, 
these Bayes factors also point to M27 as the most favoured model among the grouped 
t-copulas {Mfi, 1 < h < 32) confirming the likelihood ratio tests. The larger the Bayes 
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factor B(jh, the stronger the gainst model (1 < h < 32). 

The DIG values for 33 models are shown in column 3 of Table 5. Since only relative 
DIG value matters, the common constant in (!22l) was set in such a way that the DIG 
value for model Mq is zero. As shown in Table 5, the DIG value for all the other models is 
significantly positive, relative to that of Mq. Thus under the DIG criterion the model of 
choice is clearly Mq, i.e. the generalized t-copula. In addition, similar to the RISE based 
Bayes factors and the likelihood tests, the DIG values also pick M27 as the most likely 
grouped t-copula model after Mq, the same as the RISE factor and the likelihood ratio 
test. The larger the DIG value, the stronger the case against the model. It is interesting to 
observe that the magnitude of the DIG value is close to that of the logarithm of the Bayes 
factor based on the reciprocal importance sampling estimator, when both are evaluated 
relative to the same model Mq. 

As shown by column 4 in Table 5, the results for posterior model probabilities f l24l) 
also agree with the RISE and DIG results, i.e. model Mq has a very high probability of 
88%, and model M27 has the second highest probability. If we exclude model Mq, then 
model M27 has a high probability of 68%. In summary, all three Bayesian choice criteria 
point to the same model Mq as the best choice followed by model M27, and these choices 
are in agreement with the classical likelihood ratio tests, as shown in Table 4. 

5 Conditional Value-at-Risk 

Gonsider a portfolio of six major currencies. Denote the exchange rates (USD per currency 
unit) for these currencies at time t by \ i = 1, . . . ,Q. Assume we hold Aj units for the 
j^th currency. The portfolio value at time t is then K^*) = Yl^=i ^iSf^ ■ The log-return for 
the i^^ currency at time t + 1 is given by xf = In S^^^^'^ — In Sf^ . The portfolio loss for 
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one time step is then 

- = y W - = ^A,5f (l-exp(xf+^))) 

1=1 

= yW^t..(l-exp(xr))), (30) 

where Wj = AjS'j-*^/!^^*^ is the proportion of the portfoho value in currency i at time t, that 
is, it is the dollar weight of the i*'* currency. Now we wish to simulate the distribution of 
portfolio return 

i=l i=l 

In the present study we model the dependence of the log-returns x^l^^^ by one of the 
t-copula models as described in the previous sections. Recall that the dof parameters and 
their posterior distributions are already obtained by Bayesian MCMC. To focus on the 
impact of copula models, we use the standard normal distribution for all the six marginals. 
We take the CVaR as our risk measure. Assume that a random variable Z has continuous 
density /(•) and distribution -F(-). Given a threshold quantile level a, the CVaR above 
F~^[a) is defined as 

oo 

CVaRa[Z] = E[Z\Z>F-\a)] = [ xf{x)dx, (31) 

I - a J 

F-i(a) 

which is the average of the losses exceeding F~^{a). To demonstrate model impact on 
risk quantification, we compare CVaR of the two most likely models, Mq and M27, the 
best and the second best models of all 33 candidates. CVaR is calculated numerically 
using 10^ Monte Carlo simulations with i-copula model parameters given in Table (4). 

Table 6 shows CVaR^^gg and CVaRQ^l^^ predicted by models Mq and M27 for two 
portfolios (defined by weights in Table 6). Note in both portfolios we have negative 
weights (seUing the currency) and the weights in each portfolio add to 1.0. As shown 
in Table 6, model M27 underestimates the 0.99 CVaR by 16% for the first portfolio, and 

24 



this underestimate reverses to a slight overestimate for the second portfoho, assuming 
the correct estimates are from model Mq. The second portfoho is only slightly different 
from the first - swapping the position of EUR and CHF (long/short position) in the 
first portfoho yields the second portfolio. The two portfohos are deliberately chosen to 
demonstrate that the model impact on risk quantification can be in either direction - it 
may be overestimation or it may be underestimation, depending on the portfolio. Thus it 
is important to choose the most suitable model statistically, such as by means of Bayesian 
model inference. Table 7 compares 0.99 CVaR prediction of model Mq with that of model 
M4, the most hkely model from the (3,3) configuration, for the same two portfohos as 
those in Table 6. Here again the 0.99 CVaR for the first portfoho is underestimated by 
the incorrect model, and for the second portfolio it is overestimated. 

6 Conclusion 

This paper describes a Bayesian model choice methodology for i-copula models. As an 
illustration, altogether 33 i-copula models of six dimensions were considered: the general- 
ized t-copula; the standard t-copula; and 31 grouped t-copula models from the complete 
subset of (3,3), (2,4) and (1,5) configurations. MCMC simulations under a Bayesian in- 
ference framework were performed to obtain the posterior distribution of dof parameters 
for all 33 t-copula models. Using historical data of foreign exchange rates as a case study, 
we found that Bayesian model choice based on the RISE, the DIG and the posterior model 
probabilities overwhelmingly favors the generalized t-copula model Mq. In addition, all 
three Bayesian choice criteria point to the same second most likely model M27. These 
Bayesian choices are also in agreement with classical hkelihood ratio tests. 

The impact of model choice on the CVaR for two portfohos of six FX majors was 
observed to be significant. 

For a comprehensive modeling of multivariate dependence in finance or insurance, there 
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are other issues in data analysis that should be addressed carefully, such as time-dependent 
correlation parameters and validation. These are not considered in the present study. 
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Figure 1: Markov chain paths for parameter i>[^^ of model Mq 
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Figure 2: Autocorrelation of Markov chain samples for dof parameter of model Mq 
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Table 1: Group configurations and parameters for the 33 t-copula models. 
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Table 2: Autocorrelation estimates and corresponding cut-off lag number. 
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Table 3: MCMC output values of posterior mean, standard error and log-likelihood. 



Model Posterior Mean (Standard Error) Log-likelihood 
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2338.6 


71 /T 

M3 


(3) 


= 18.2 (0.28), 


(3) 
^2 


= 8.56 (0.09) 






/I 1 r\ 

2341.9 


71 /T 

M4 


(4) 


= 24.4 (0.82), 


(4) 
Z/^ = 


= 7.78 (0.16) 






2343.8 


71 /T 

M5 


(5) 


= 8.51 (0.13), 


(5) 
Z/^ = 


= 18.7 (0.49) 






2341.9 


71 /T 

Mq 


(6) 


= 13.4 (0.26), 


(6) 


= 10.2 (0.16) 






0000 

2338.6 


M7 


(7) 


= 13.6 (0.27), 


(7) 
^ = 


= 10.3 (0.17) 






2338.6 




(8) 


= 9.24(0.13), 


(8) 
^2 = 


: 15.7 (0.35) 






2343.2 


Mg 


(9) 


= 8.76(0.13), 


(9) 
Z/^ = 


: 14.0 (0.32) 






2343.2 


71 /f" 

Mio 


(10) 


— 12.6(0.23), 


(10) 
^2 


-1-1 -1 / -| 

= 11.1 (0.17) 






2336.7 


71 /f" 

Mil 


(11) 


— 27.9(4.87), 


(11) 
^2 


Ci /~' ( r\ r\f\\ 

— 8.6 (0.09) 






2336.7 


71 /f" 

M12 


(12) 


1 A t ( r\ A a\ 

— 14.1(0.44), 


(12) 
i^2 


— 10.6 (0.14) 






2336.7 


71 /T 

Mi3 


(13) 


= 8.56(0.11), 


(13) 
^^2 


— 13.8 (0.25) 






2343.6 


71 /f" 

Mi4 


(14) 


— 13.2(0.74), 


(14) 
i^2 


= 11.1 (0.14) 






2336.7 


71 /T 

Mi5 


(15) 
^1 


= 13.3(0.94), 


(15) 
i^2 


= 11.2 (0.17) 






2336.6 


71 /T 

M16 


(16) 
^1 


— 17.4(0.85), 


(16) 
^^2 


= 9.78 (0.13) 






2343.0 


71 /T 

Mi7 


(17) 


— 9.78(0.14), 


(17) 
^2 


— 12.9 (0.2) 






2338.9 


71 /f" 

M18 


(18) 


C\ A C\ f A A\ 

— 24.9(4.4), 


(18) 


c\f^ f f\ r\c\\ 

= 8.96 (0.09) 






2342.5 


71 /f" 

Mi9 


(19) 


= 32.5(7.99), 


(19) 
^2 


£*1 IT/^ f f\ f\Ci\ 

= 8.76 (0.08) 






2343.4 


71 /T 

M20 


(20) 


= 7.46(0.11), 


(20) 
^^2 


= 16.8 (0.59) 






2343.0 


71 /f 

M21 


(21) 


= 14.2(0.43), 


(21) 
^2 


= 10.5 (0.14) 






2338.6 


71 yT 

M22 


(22) 
I/} 


= 14.3(0.49), 


(22) 
^2 


ir\ r" { c\ ir'N 

= 10.5 (0.15) 






0000 /< 

2338.4 


71 J" 

M23 


(23) 
I/} 


= 8.64(0.11), 


(23) 
^2 


1/1 ( c\ ^cW 

= 14.2 (0.32) 






00 /I 

2343.7 


71 ^ 

M24 


(24) 


n n ( r\ r\c\\ 

= 8.66(0.09), 


(24) 
^^2 


= 13.6 (0.23) 






00 /I /I 

2343.4 


71 ,T 

M25 


(25) 
I/} 


= 12.8(0.50), 


(25) 

7/ 

^^2 


11 iTx { r\ io\ 

= 11.2 (0.13) 






2336.7 


71 T 

M26 


(26) 


= 15.2(1.63), 


(26) 
^^2 


11 //^Ol\ 

= 11.3 (0.31) 






000/^ /I 

2336.4 


iv:z27 


^(27) 
'^l 


— 64 7('3 01 


^^2 


— q 9fi (n 0A\ 






2346 7 


M28 


(28) 


= 16.0(1.15), 


^(28) 
'^2 


= 11.0 (0.38) 






2338.4 


M29 


(29) 


= 7.94(0.32), 


^(29) 
^^2 


= 12.9 (0.48) 






2344.4 


M30 


(30) 


= 15.1(1.53), 


^(30) 
1^2 


= 11.3 (0.32) 






2336.5 


M31 


(31) 


= 11.4(0.32), 


^(31) 
^^2 


= 1.81 (0.17) 






2336.4 


M32 


(32) 

z/} 


= 11.4 (0.14) 










2336.8 
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Table 4: MCMC output values of posterior mode, corresponding log-likelihood, likelihood 
ratio A and p- value comparing with Mq. 



Model MCMC posterior mode Log-likelihood —2 log(A) p-value 



Mo 


1/(0) = 


11.5, S 


2.4,7.92,5.81,10.3,14.3) 


2354.3 





N/A 


Ml 


1/(1) = 


14.0, 


8.96) 


2342.9 


22.8 


0.00014 


M2 


i/(2) = 


;9.75, 


12.6) 


2338.7 


31.2 


<0.00001 


M3 


r/(3) = 


16.6, 


8.22) 


2342.1 


24.4 


<0.0001 


M4 


1/(4) = 


[21.0, 


7.49) 


2344.1 


20.4 


0.00042 


Ms 


J/(5) = 


;8.i7. 


16.9) 


2342.1 


24.4 


<0.0001 


Me 


J/(6) = 


12.6, 


9.73) 


2338.8 


31.0 


<0.00001 


M7 


r/(7) = 


12.8, 


9.79) 


2338.7 


31.2 


<0.00001 


Ms 


Z/(8) = 


;8.84, 


14.4) 


2343.4 


21.8 


0.00022 


Mg 


1/(9) = 


;8.76, 


14.0) 


2343.3 


22.0 


0.00020 


Mio 


1,(10) = 


(11.8, 


10.5) 


2336.9 


34.8 


<0.000001 


Mn 


1/(11) _ 


(22.6, 


8.68) 


2343.0 


22.6 


0.00015 


M12 


1/(12) _ 


(13.1, 


10.2) 


2338.6 


31.5 


<0.00001 


Mi3 


1/(13) = 


(8.23, 


13.1) 


2343.7 


21.3 


0.00028 


Mi4 


1/(14) _ 


(11.9, 


10.7) 


2336.9 


34.8 


<0.000001 




1/(15) = 


(11.7, 


10.8) 


2336.8 


34.9 


<0.000001 


M16 


1/(16) = 


(15.7, 


9.39) 


2343.2 


22.2 


0.00018 


Mi7 


1/(17) = 


(9.31, 


12.7) 


2339.0 


30.6 


<0.00001 


M18 


1/(18) = 


(20.9, 


8.66) 


2342.7 


23.1 


0.00012 


Mi9 


1/(19) = 


(25.2, 


8.48) 


2343.8 


21.1 


0.00030 


M20 


1/(20) ^ 


(7.14, 


15.7) 


2343.1 


22.3 


0.00017 


M21 


1/(21) ^ 


(13.2, 


10.1) 


2338.5 


31.5 


<0.00001 


M22 


1/(22) ^ 


(13.3, 


10.1) 


2338.5 


31.5 


<0.00001 


M23 


1/(23) ^ 


(8.27, 


13.4) 


2343.8 


21.0 


0.00031 


M24 


1/(24) ^ 


(8.31, 


12.9) 


2343.5 


21.7 


0.00023 


M25 


i/(25) = 


(11.7, 


10.8) 


2336.8 


34.9 


<0.000001 


M26 


1/(26) = 


(11.8, 


10.9) 


2336.8 


34.9 


<0.000001 


M27 


J/(27) = 


(68.3, 


9.03) 


2346.8 


15.1 


0.0045 


M28 


J/(28) = 


(14.0, 


10.5) 


2338.6 


31.4 


<0.00001 


M29 


J/(29) = 


(7.55, 


12.2) 


2344.5 


19.6 


0.00060 


M30 


1/(30) ^ 


(12.0, 


10.9) 


2336.8 


34.9 


<0.000001 


M31 


i/(31) = 


(11.4, 


11.0) 


2336.8 


35.0 


<0.000001 


M32 


1/(32) ^ 


(11.1) 




2336.9 


34.9 


<0.00001 
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Table 5: Bayes factors Bqi, DIG and model probabilities of all candidates. 



Model 


Log{Boh) 


DIG 


Model prob. (%) 


Model prob. (%) excl. Mq 


Mo 








88.5 


N/A 


Ml 


16.7 


15.8 


0.15 


1.28 


M2 


20.1 


24.0 


< 0.1 


< 0.1 


M3 


16.6 


17.5 


< 0.1 


0.54 


M4 


16.4 


13.0 


0.45 


3.89 


Ms 


18.7 


17.4 


< 0.1 


0.55 


Me 


19.3 


24.1 


< 0.1 


< 0.1 


My 


21.5 


24.3 


< 0.1 


< 0.1 


Mg 


14.5 


15.0 


0.12 


1.07 


Mg 


18.9 


15.1 


0.22 


1.89 


Mio 


21.9 


27.8 


< 0.1 


< 0.1 


Mn 


18.0 


15.6 


0.15 


1.26 


M12 


24.9 


24.7 


< 0.1 


< 0.1 


Mi3 


14.9 


14.3 


0.31 


2.68 


Mi4 


23.2 


27.9 


< 0.1 


< 0.1 


Mi5 


25.5 


28.0 


< 0.1 


< 0.1 


M16 


14.2 


15.3 


0.20 


1.73 


Mi7 


21.7 


23.7 


< 0.1 


< 0.1 


M18 


15.3 


16.1 


0.12 


1.03 


Mi9 


15.1 


14.1 


0.29 


2.52 


M20 


15.1 


15.3 


0.19 


1.64 


M21 


21.5 


24.3 


< 0.1 


< 0.1 


M22 


20.6 


24.6 


< 0.1 


< 0.1 


M23 


17.1 


14.1 


0.36 


3.10 


M24 


15.4 


14.7 


0.25 


2.22 


M25 


22.6 


27.9 


< 0.1 


< 0.1 


M26 


22.2 


28.0 


< 0.1 


< 0.1 


M27 


11.0 


6.73 


7.8 


68.3 


M28 


21.0 


24.6 


< 0.1 


< 0.1 


M29 


16.3 


12.7 


0.7 


6.18 


M30 


28.0 


28.0 


< 0.1 


< 0.1 


M31 


22.8 


28.1 


< 0.1 


< 0.1 




22.3 


28.1 


< 0.1 


< 0.1 
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Table 6: The 0.99 conditional Value-at-Risk {CVaRo.99) predicted by model Mq and M27, 
for two portfolios of six major currencies. 5 — (CVaR^gg — CVaR^gg)/CVaR^gg is the 
relative difference of CVaR between M27 and Mp. Standard errors are in parentheses. 



Portfolio asset weights Wi 

(ai:d. cad. chf. ei;r. gbp. jpy) 




CVaR^^,f 


S 


(0.25, 0.25, 0.8, -0.8, 0.25, 0.25) 


1.707 (0.004) 


1.425 (0.003) 


-16.5% 


(0.25, 0.25, -0.8, 0.8, 0.25, 0.25) 


1.737 (0.003) 


1.782 (0.003) 


2.6% 



Table 7: The 0.99 conditional Value-at-Risk (CVaRo^gg) predicted by model Mq and M4, 
for two portfohos of six major currencies. 5 — (CVaR^^gg — CVaR^gg)/CVaR^gg is the 
relative difference of CVaR between M4 and Mp. Standard errors are in parentheses. 



Portfolio asset weights Wi 
(AUD, CAD, CHF, EUR, GBP, JPY) 




CVaR^Q^g^ 


S 


(0.25, 0.25, 0.8, -0.8, 0.25, 0.25) 


1.571 (0.004) 


1.366 (0.003) 


-13.0% 


(0.25, 0.25, -0.8, 0.8, 0.25, 0.25) 


1.608 (0.003) 


1.732 (0.003) 


7.7% 
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